Handwritten Character Recognition Using Multiclass SVM Classification with Hybrid Feature Extraction

نویسندگان

  • Muhammad Naeem Ayyaz
  • Imran Javed
  • Waqar Mahmood
چکیده

In this paper, we describe hybrid feature extraction for offline handwritten character recognition. The proposed technique is a hybrid of structural, statistical and correlation features. In the first step, the proposed technique identifies the type and location of some elementary strokes in the character. The strokes to be looked for comprise horizontal, vertical, positive slant and negative slant lines–as we observe that the structure of any character can be approximated with the help of a combination of simple straight line strokes. The strokes are identified by correlating different segments of the character with the chosen elementary shapes. These normalized correlation values at different segments of the character give correlation features. For making feature extraction more robust, we add in the second step certain structural/statistical features to the correlation features. The added structural/statistical features are based on projections, profiles, invariant moments, endpoints and junction points. This enhanced, powerful combination of features results in a 157-variable feature vector for each character, which we find adequate enough to uniquely represent and identify each character. Prior, handwritten character recognition problem has not been addressed the way our proposed hybrid feature extraction technique deals with it. The extracted feature vector is used during the training phase for building a support vector machine (SVM) classifier. The trained SVM classifier is subsequently used during the testing phase for classifying unknown characters. Experiments were performed on handwritten digit characters and uppercase alphabets taken from different writers, without any constraint on writing style. The obtained results were compared with some related existing approaches. Owing to the proposed technique, the results obtained show higher efficiency regarding classifier accuracy, memory size and training time as compared to these other existing approaches.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

A Hybrid Method for Multiclass Classification and Its Application to Handwritten Character Recognition

The support vector machine (SVM) is an effective pattern classification method. However, solving N(N-1)/2 binary classifications in the training phase makes it too costly to use SVM in applications with a high number N of class types. In this paper, we propose a new prototype classification method that can be combined with SVM for pattern recognition. This hybrid method has the following merits...

متن کامل

Zernike Moment Feature Extraction for Handwritten Devanagari (Marathi) Compound Character Recognition

Compound character recognition of Devanagari script is one of the challenging tasks since the characters are complex in structure and can be modified by writing combination of two or more characters. These compound characters occurs 12 to 15% in the Devanagari Script. The moment based techniques are being successfully applied to several image processing problems and represents a fundamental too...

متن کامل

An Investigation on the Performance of Hybrid Features for Feed Forward Neural Network Based English Handwritten Character Recognition System

Optical Characters Recognition (OCR) is one of the active subjects of research in the field of pattern recognition. The two main stages in the OCR system are feature extraction and classification. In this paper, a new hybrid feature extraction technique and a neural network classifier are proposed for off-line handwritten English character recognition system. The hybrid features are obtained by...

متن کامل

Greek Handwritten Character Recognition

In this paper, we present a database and methods for off-line isolated Greek handwritten character recognition. The Computational Intelligence Laboratory (CIL) Database consists of 35,000 isolated and labelled Greek handwritten characters. This database was tested with an existing structural approach for Greek handwritten characters as well as with a novel approach based on a hybrid feature ext...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012